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zation of C glutamicum genes involved in amino acid biosynthesis, spe- 
cifically, encoding hon% thrh, and thrC, and sequences regulating their 
expression. Techniques for modifying or replacing these sequences and 
means for facilitating further isolations and characterizations, including 
promotor probe vectors which are useful in screening for high efficiency 
and regulatable promoters and repressors, are also disclosed. A C. glu- 
tamicum genomic library was constructed by cleaving chromosomal 
DNA with restriction enzymes, inserting the DNA fragments into an ap- 
propriate vector, and transforming the resulting recombinant molecules 
(rDNA) into C. glutamicum. Amino acid biosynthetic genes horn, thrB, 
and thrC t encoding homoserine dehydrogenase, homoserine kinase, and 
threonine synthetase, respectively, were isolated by complementation of 
C. glutamicum auxotrophs. The hom-thrB genes were subcloned on a 3.6 
kb Sail generated chromosomal fragment while thrC activity was isolat- 
ed from a second recombinant plasmid within the genomic library and 
subcloned on a 2.7 kb Sph\ generated fragment. The hom-thrB and thrC 
loci, and regulatory sequences, were identified by enzyme assays, com- 
plementation of defined E. coli auxotrophs, SI nuclease and deletion 
mapping. 
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C. GLUTAMICUM THREONINE BIOSYNTHETIC PATHWAY 

Background of the Invention 

The present invention is generally in the field 
of genetic engineering, and specifically, in the 
area of manipulation of amino acid biosynthesis in 
Gram positive bacteria, 

Corynebacterium qlutamicum is a Gram positive, 
nonpathogenic microorganism that has long occupied a 
central role in the industrial production of amino 
acids by conventional fermentation processes. Past 
strain development has primarily depended on classi- 
cal mutagenesis to remove competing pathways to 
thereby increase substrate availability, and to 
remove or reduce regulatory control of a particular 
biosynthetic pathway. Regulatory mutants were 
isolated by selecting strains resistant to toxic 
amino acid analogues. The use of chemical muta- 
genesis has been very successful and a number of 
economically viable L-amino acid fermentation 
strains, such as strain producing L-glutamate and 
L-lysine, have been established. 

The recent development of cloning vectors, 
including those described in U.S. Patent No. 
4,649,119 to Sinskey et al., and methods for DNA 
transformation of glutamicum , as decribed by 
Katsumata et al . , J. Bacteriol. 159,306-311 (1984), 
and Yoshihama et al., J. Bacteriol. 162, 591-597 
(1985) , and the closely related Corynebacterium 
(Brevibacterium) lactof ermentum described by 



WO 88/09819 



PCT/US88/02029 



2 

f 

Santamaria et al. in J. Gen. Microbiol, 13 0, 

2237-2246 (1984), initiated a new era in the genetic * 

manipulation of these organisms* 

However, the commercial utilization of 
glutamicum recombinant DNA technologies for future 
strain development is dependent on the development 
of additional genetic tools and a better understand- 
ing of the fundamental molecular biology of this 
species. The use of recombinant DNA techniques to 
develop industrial strains would offer several 
advantages over classical mutagenic strategies,. - For 
example, specific alterations such as the replace- 
ment of a low efficiency promoter would be possible, 
the stepwise isolation of enhancing mutations could 
be avoided; regulatory systems could be engineered 
to allow the temporal control of gene expression 
during a fermentation process, and novel genes 
and/ or pathways could be introduced into an 
organism.. 

It is therefore an object of the present 
invention to isolate and characterize genes encoding 
components of amino acid biosynthetic pathways in 
Corynebacterium . 

It is another object of the present invention 
to clone the isolated amino acid biosynthetic genes, 
specifically those involved in the threonine bio- 
synthetic pathway. 

It is still another object of the present 
invention to elucidate the structure of these genes $ 
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and the regulatory mechanisms that modulate their 
expression. 

It is a further object of the present invention 
to characterize and modify the expression of the 
cloned, amino acid biosynthetic genes, as well as 
the primary structure and regulatory features of 
their protein products. 

Summary of the Invention 

The present invention is a method for the 
isolation and characterization of c. glutamicum 
genes involved in amino acid biosynthesis, 
specifically, horn , thrB, and thr C, and sequences 
regulating their expression. Techniques for modify- 
ing their expression and regulation are also des- 
cribed. Methods and sequences facilitating further 
isolations and characterizations are also disclosed, 
including promoter probe .vectors which are useful in 
screening for high efficiency and regulated 
promoters . 

A C. glutamicum genomic library was constructed 
by cleaving chromosomal DNA with the restriction • 
enzyme Mbol, inserting the resultant DNA fragments 
into a C glutamicum/Bacillus subtilis shuttle 
vector, pHY416, and transforming the resulting 
recombinant molecules into C. glutamicum . Amino 
acid biosynthetic genes horn , thr B, and thrC, encod- 
ing homoserine dehydrogenase, homoserine kinase, and 
threonine synthase, respectively, were isolated by 
complementation of C. glutamicum auxotrophs. The 
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hom - thr B genes were subcloned on a 3.6 kb Sai l 

generated chromosomal fragment while thrC activity ' 

was isolated from a second recombinant plasmid 

within the genomic library and subcloned on a 2.7 kb 

S£hl generated fragment. The hom-thr B and thrC loci 

were identified by a combination of enzyme assays 

and complementation of defined E. coli auxotrophs, 

and amino acid sequence homology. 

Enzymatic assay of homoserine dehydrogenase 
activity , encoded by horn , in strains harboring the 
cloned gene demonstrated a 20-fold increase in 
specific activity compared to wild type controls. 
Both the chromosomal and plasmid encoded activities 
are strongly inhibited by L-threonine and repressed 
by L-methionine. The L-methionine repression of the 
plasmid encoded activity demonstrates that the 
structural gene and sequences responsible for -its 
expression are included within the cloned "fragment. 
Southern hybridization analysis demonstrated that 
the hom/thr B and thr C. loci are separated by a 
minimum of 8.8 kb in the c. glutamicum chromosome. 
This is a different genomic organization from. that, 
observed in E. coli where the three genes represent 
a single operon. Three lines of evidence demon- 
strate that the C. glutamicum hom- thrB genes repre- 
sent an operon. First, they are located together 
(separated by 11 base pairs) and coordinately 
regulated by L-methionine. Secondly, Northern 
hybridization analysis has identified a single 2.4 * 
kb, L-methionine repressed RNA transcript, 
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consistent with the size of the two coding regions . 
Finally, deletion of the promoter upstream of the 
horn gene significantly reduces the expression of 
both the hom and thr B genes. 

The hom - thr B and thr C promoters were identified 
by complementation of auxotrophs, deletion analysis 
and SI nuclease mapping. The hom-thrB operator, a 
hyphenated dyad symmetry element, was also identi- 
fied by deletion analysis. Methods for modifica- 
tion, removal or replacement of these regulatory 
elements are described. 

Brief Description of the Drawings 

Figure 1 is a schematic of the threonine 

biosynthetic pathway. 

Figure ' 2 is a graphic depiction of subcloning 

strategy and restriction maps of recombinant plas- 

mids pFS78, pFS80, pFS3.6A, pFS3.6B, pSPCl and 

pSPC4 . 

Figure 3 is the nucleotide sequence and pre- ' 
dieted protein sequences of hom and thr B. 

Figure 4 is the nucleotide sequence and pre- 
dicted protein sequence of the thr C gene. 

Figure 5a is the sequence of the C. glutamicum 
hom - thr B regulatory region indicating the mRNA 
initiation site, -35 and -10 regions of thr Pl and 
the hyphenated dyad symmetry element responsible for 
methionine mediated repression ( thr O) . 

Figure 5b is the potential stem/loop structure 
formed by the hyphenated dyad symmetry element. 



/ 
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Figure 6 is a schematic of the construction of 

C. glutamicum hom-thrB promoter deletions and subse- 
quent analysis. 

Figure 7: Deletional analysis of the hom - thr B 
promoter region. The 3.6 kb C. glutamicum chromo- 
somal DNA insert of pFS3.6 carrying the hom -thrB 
genes is indicated as a hatched box and the nucleo- 
tide sequence of the relevant promoter-containing 
Dra l- Hind lll fragment is shown. The extent of Bal3l 
generated deletions in various plasmid constructs 
based on vector pWSTl are presented as black bars. 
The start of transcription as determined by SI 
nuclease mapping is- indicated by an arrow. 

Detailed Description , of the Invention 

Recombinant DNA technology has been used to 
isolate, characterize and manipulate genes involved 
in the amino acid biosynthetic pathway of Coryne- 
bacterium glutamicum . The technology and results 
obtained aid in the elucidation of the fundamental 
molecular biology of glutamicum and construction 
of amino acid producing strains, particularly 
threonine. 

Threonine is produced in a series of reactions 
beginning with the reduction of the beta-carboxyl 
group of aspartic acid to from the aldehyde, 
aspartic beta-semialdehyde, which takes place via an 
acyl phosphate intermediate, beta-aspartyl phos- * 
phate, in an ATP requiring reaction. Aspartic 
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beta-semialdehyde is converted by homoserine de- 
hydrogenase, encoded by horn , to homoserine. Homo- 
serine is phosphorylated by homoserine kinase, 
encoded by thrB, to homoserine phosphate in an ATP 
requiring reaction. The product, homoserine phos- 
phoric acid, is in turn converted to threonine by 
threonine synthase, encoded by thr C, a pyridoxal 
phosphate enzyme. Threonine, the end product of the 
sequence, is an inhibitory modulator of aspartate 
kinase. This reaction pathway is demonstrated in 
Figure 1. 

Genes encoding the three enzymes, homoserine 
dehydrogenase ( horn ) , homoserine kinase ( thr B) , and 
threonine synthetase (thrC) have been isolated, 
identified, cloned, and their expression modified as 
follows. The cloning and determination of the 
nucleotide sequence of these genes provides a means 
for manipulating the expression and catalytic 
properties of the encoded enzymes. Means for 
altering the expression and the end product include 
in vitro mutagenesis of the C. glutamicum horn gene 
and selection of derivatives resistant to L- 
threonine mediated feedback inhibition, sequence 
determination of feedback resistant derivatives and" 
the use of rDNA techniques to combine separate 
genetic alterations, determination and modification 
of the promoter structure and protein start sites 
for horn , thr B, and thrC, increased expression of 
hom - thr B via increased promoter efficiency and 
removal of L-methionine transcriptional repression, 
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and molecular joining of the C. glutamicum hom-thrB 

and thr C genes to form a hom-thr BC operon. * 

Isolation of thr- and thr-/met- auxotrophs of C. 

glutamicum . 

A genetic background in which to isolate the 
threonine biosynthetic genes was constructed by 
mutating C. glutamicum and isolating auxotrophs 
defective in threonine biosynthesis. The c. 
glutamicum is maintained on LB media (10 g NaCl, 10 
g Bactotryptone, 5 g Yeast extract, l.l. H 2 0) or 
minimal medium for C. glutamicum (MCG) (10 g 
glucose, 7 g (NH 4 ) 2 S0 4 , 3g K 2 HP0 4 , lg KH 2 P0 4 , 0.4 g 
MgS0 4 .7 H 2 0, 2 mg FeSO^I^O, 2 mg MnS0 4 .H 2 0, 1 mg 
Biotin, 10 mg Thiamine, 2 ml trace elements, 1 
l.H 2 0) . 1.4% agar was added for: plates. The trace, 
elements contained 44 mg Na 2 B 4 0 ? , -7^0, 20 mg 
(NH 4 ) 6 Mo ? 0 27 .4H 2 0, 5 mg ZnS0 4 ,- 135 mg Cu50 4 .5H 2 0, 
3.6 mg MnCl 2 .H 2 o, 435 mg FeCl^ in 500 ml H 2 0. Where 
appropriate, 50 g/ml L-threonine, 50 g/ml.L- 
methionine, 50 g/ml ampicillin, 15 g/ml kanamycin or 
10 g/ml rifampicin were added. 

C. glutamicum AS019, a rifampicin resistant 
variant of ATCC 13059, was grown at 30° c in LB to 
exponential phase (2 x 10 8 cfu/ml) , harvested by 
centrifugation and resuspended in an equal volume of 
minimal media for C. glutamicum (MCG) . Cells were 
mutagenized by the addition of nitrosoguanidine 
(NTG). (40 micrograms/ml) to 1 ml of cells and 
incubation without shaking at 30 'C for 30 minutes. * 
Mutagenized cells were harvested by centrifugation, 



/ 



resuspended in 1 ml LB media and diluted 1:100 into 
10 ml aliguots of fresh LB. Following growth at 
30 °C with shaking to stationary phase , the cells 
were diluted and plated on LB agar, Auxotrophs were 
screened by replica plating onto MCG plates and 
identification by growth patterns on amino acid 
pools* Only one strain displaying a particular 
auxotrophy such as threonine requirement was saved 
from each of the 10 ml aliquots. 

Twenty four thr- and six thr "/met- C. gluta- 
mi cum auxotrophs were isolated. The thr/met- 
auxotrophs grow on MCG plates supplemented with 
homoserine. The thr- auxotrophs may have mutations 
in either of the threonine specific enzymes, homo- 
serine kinase or threonine synthase. 
Transformation and complementation of the C. 
glutamicum auxotrophs . 

Two threonine requiring auxotrophs of C. 
glutamicum , AS155 and AS178, were transformed using 
the following method. An overnight culture of AS019 
was inocculated at a ratio of 1:100 into LB broth 
containing 0.2% glucose and 2 . 0% ..glycine. The cells 
were incubated at 30 °C for 15 hours with aeration. 
10 ml of cells were harvested by centrifugation and 
washed in SMMC buffer (0.5/M Sorbitol, 2 0 mM MgS0 4/ 
20 mM CaCl 2 , 50 mM Na Maleate, pH 7.0). Cells were 
resuspended in 2 ml SMMC buffer containing 2.5 mg 
lysozyme/ml. The cell suspension was incubated at 
37 °C with shaking for 90 minutes. Cells were again 
harvested by centrifugation at 6000 rpm for ten 
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minutes and resuspended in three ml SMMC buffer. 

0.3 ml aliquots of "protoplasted" cells were placed * 

in polypropylene tubes. Plamid DNA in 0.5 M 

sorbitol was added. 0.7 ml of 40% PEG, molecular 

weight 3350, 50 mM Tris, 20 mM CaCl. pH 7.4 was 

added and gently mixed. 2.0 ml of SB broth (0.5 M 

sorbitol, 1 x LB, 20 mM CaCl 2 , 20 mM MgS0 4 ) was 

added to the transformation mixture, which was- then 

incubated at 30° C without shaking for three hours. 

The C. qlutamicum protoplasts obtained by growth in 

glycine and lysozyme treatment can also be suspended 

in SMMC and frozen at -80 *C for use in subsequent 

transformations. 

The transformants were plated out on selected 
plates. The two threonine requiring auxotrophs 
AS 155 and AS178 were transformed with a C. 
qlutamicum genomic library containing approximately 
2.5 genomic equivalents constructed in the C. 
glutamicum/ B. subtilis chimeric plasmid pHY416, 
described by Yoshihama et al., J. Bacterid. 162, 
591-597 (1985) and Follettie and Sinskey in J. Bac- 
terid. 166 695-702 (1986). Kanamycin resistant 
transformants were selected and screened for comple- 
mentation of the threonine auxotrophy by replica 
plating onto MCG/Km plates. Three AS155 trans- 
formants and a single AS 178 trans formant were 
capable of growth without threonine supplementation. 
Plasmids were isolated and characterized by restric- 
tion analysis. S 



WO 88/09819 

4} ( 



PCT/US88/02029 



11 

All four trans fonaants harbored the same 
recombinant plasmid, designated pFS78, described in 
Figure 2, which contain a 6.8 kb chromosomal DNA 
insert. The recombinant plasmid was transformed 
into 10 of the independently isolated C. glutamicum 
threonine auxotrophs and three auxotrophs requiring 

threonine and methionine or homoserine supplementa- 

• r 
tion. The Km trans formants were screened for 

complementation on MCG/Km plates. The results 
demonstrated that pFS78 complements all three 
homoserine auxotrophs and four of the ten thr- 
auxotrophs, indicating that the plasmid carries the 
homoserine dehydrogenase gene, horn , as well as one 
of the threonine specific genes, thr B or thrC. 

Two of the thr- auxotrophs not complemented by 
pFS78, AS148 and AS213, were transformed with the 
genomic library and Km r colonies screened for growth 
on MCG/Km plates. Both thr+ AS148 and thr+ AS213 
transformants were obtained, and their plasmids 
isolated and characterized by restriction analysis. 
All thr+ transformants harbor the same 12.5 Jcb 
recombinant plasmid designated pFS80, also shown in 
Figure 2, containing a 3.1 kb chromosomal DNA 
insert. The chromosomal sequence cloned in pFS80 
complements four other thr- auxotrophs not comple- 
mented by pFS78. However, pFS80 was unable to 
complement the thr- or thr-/met- strains comple- 
mented by pFS78. 

Subcloning and identification by enzyme assay, 
complementation of auxotrophs, and amino acid 
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sequence homology of the C. glutaroicum horn, thrB r 

and thrc locus , * 

Deletion analysis of pFS78 indicated that both 
the thr- and thr -/ met - complementing activities are 
located on a 3.6 kb Sai l generated chromosomal 
fragment. This fragment was purified by agarose gel 
electrophoresis and electroelution and ligated into 
the unique Sai l restriction site of the C. 
qlutamicum/E. coli chimeric vector pWS124, described 
by Batt, Shamnabruch and Sinskey, Biotech. Letts. 
7:717 (1985). The recombinant vector, pFS3.6, 
complements both AS178 (thr-) and AS253 (thr-/met-) . 
The plasmid also complements E. coli thrB auxotroph, 
E. coli 5076. 

The 2.7 Kb Sphl generated chromosomal fragment 
of pFS80 was purified by agarose gel electrophoresis 
and ligated into the unique Sphl restriction site of 
PUC18. The resulting recombinant plasmids, desig- 
nated pSPCl and pSPC4, also diagrammed in Figure 2, 
were able to complement E. coli 5077 (thrC) but not 
E. coli 5076 (thrB) . 

Southern hybridization analysis was used to 
determine the relationship of the hom-thrB and thr C 
loci. The results demonstrate that the hom-thrB and 
thrC locus in this species are physically separated 
by a minimum of 8.8 kb. 

The homoserine dehydrogenase activity, in crude 
extracts of wild type AS019 was compared to that of 
the homoserine auxotroph AS253 with and without the * 
complementing plasmid pFS3 . 6 in order, to determine 
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the regulation and extent of overproduction of the 
cloned C. glutamicum hom gene product. Methods used 
for preparing a C. glutamicum crude extract 
preparation and assays for homoserine dehydrogenase, 
homoserine kinase and aspartokinase are as follows: 

Homoserine dehydrogenase is measured by the de- 
crease in absorbance at 340 nm due to the oxidation 
of NADPH (extinction coefficient = 6220) v The 
reaction mixture contains: 3 mM DL aspartate-beta- 
semialdehyde (ASA), 0.4 mM NADPH, 0.1 M P0 4 , pH 7.0, 
0.5 M KC1, and enzyme preparation, in a total volume 
of 0.7 ml. A blank reaction mixture without DL-ASA 
serves as a control. DL-ASA is synthesized by the 
ozonolysis of DL-allyl glycine according to the 
procedure of Black and Wright, J. Biol. Chem. 213, 
39 (1955). 

Homoserine kinase activity was determined by a 
coupled enzyme assay which measured the reaction 
product ADP. The reaction mixture contained 3.3 mM 
ATP, 0.45 MM NADH, 4 . 5 mM phophenol pyruvate, 1.0 mM, 
L-homoserine, 10 mM MgCl 2 12.5 units pyruvate kinase 
(Sigma, St. Louis, MO), 25 units lactate de- 
hydrogenase (Sigma), 0.25 M KC1, 100 mM HEPES buffer 
(pH 7.8) and enzyme preparation in a total volume of 
1.0 ml. The reaction was monitored by the decrease 
in absorbance at 340 nm due to the oxidation of 
NADH. The absorbance decrease in the absence of 
added substrate, L-homoserine, was determined and 
subtracted from values obtained with the complete 
assay mixture. 
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Aspartate kinase activity, inhibited by 
threonine, is determined by measuring the 
aspartohydrbxamate produced according to the pro- 
cedure of Black and Wright, J, Biol- Chem. 213, 27 
(1955) . Protein in the crude extract is precipi- 
tated by adding 5 volumes of saturated ammonium 
sulfate and resuspended in 0.3 volume of buffer 
containing 0.1 M Tris, pH 7.4, 0.2 M KC1. The assay 
mixture contains: 0.1 M Tris, pH 7.4, 10 mM ATP, 10 
mM MgS0 4 0.6 M hydroxylamine (pH 7.4), 0.6 "M (NH 4 ) 2 , 
50 mM L-aspartate and enzyme preparation in a total 
volume of 1 ml. After 1 hr incubation at 37 °C, the 
reaction was stopped by the addition of 1.5 ml of 
solution containing 10% FeCl 3 .6 H 2 0, 3.3% trichloro- 
acetic acid and 0.7 N HC1. After centrifugation, . 
aspartohydroxamate concentration is measured by 
absorption at 540 nm (extinction coefficient = 600) . 
A blank reaction mixture without L-aspartic acid 
serves as a control. 

Protein concentration of the crude extracts is 
determined using the Bio-Rad protein assay with 
bovine serum albumin standards (BioRad Laboratories, 
Richmond, CA) . 

The Cv glutamicum thr~/i&et~ strain AS253 
harboring the parental vector pWS124 had less than 
2.5% of the. homoserine dehydrogenase activity 
present in the wild type AS 019. Introduction of the 
cloned C. glutamicum horn gene present on pFS3.6A 
into C. glutamicum AS253 leads to a twenty-fold 
increase in the specific activity of homoserine 
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dehydrogenase over that observed in wild type C. 
glutamicum AS019. The orientation of the cloned hom 
gene with respect to the vector affected its expres- 
sion. Crude extracts of AS253 harboring pFS3.6B 
demonstrated an 11-fold increase in homoserine 
dehydrogenase activity relative to wild type. 

The level of aspartokinase in glutamicum 
AS019 harboring either the parental vector pWS124 or 
the recombinant vector pFS3.6 was unchanged over 
that observed in the controls. Further, the 
aspartokinase specific activity was not repressed by 
growth in MCG supplemented with 2.7 mM L-methionine. 
The differential transcriptional control of homo- 
serine dehydrogenase, in combination with the lack 
of increased aspartokinase activity in cells harbor- 
ing pFS3.6 (hom-thrB) , demonstrates that the two 
activities are not catalyzed by a bifunctional 
protein as in coli . The expression of the 
encoded homoserine dehydrogenase is repressed 3.2 
fold by the addition of 2.7 mM L-methionine. 
Expression of the Cj_ glutamicum thrA gene is also 
repressed by L-methionine, demonstrating that the 
expression of the pFS3.6 encoded hom gene is medi- 
ated by its native promoter/ operator . Expression of 
the cloned C. glutamicum thr B gene was similarly 
repressed 2.6-fold by 2.7 mM L-methionine. 

The activity of the homoserine dehydrogenase, 
both chromosomal and plasmid encoded, is inhibited 
by the addition of L-threonine to the assay mixture. 
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Addition of 1 mM D-threonine or L-methionine does 
not affect the homoserine dehydrogenase activity. 

The complete nucleotide sequence of 3704 bp is 
shown in Figure 3 for hom and thrB. Two long open 
reading frames (ORF's) extend from position 907 to 
2329 and from 2312 to 3269. The protein sequences 
of homoserine dehydrogenase and homoserine kinase 
are predicted on the basis of the sequence extending 
from the first potential translation initiation 
codon f either ATG or GTG (position 994) to the TAA 
stop at. position 2330 for 0RF1 and from the ATG at 
2342 to the TAG stop 3269. The predicted proteins 
have molecular weights of 46,436 and 32,618 daltons 
for ORF1 and 0RF2, respectively. A translation 
terminator is present at position 3279 to 3 311, 
seven nucleotides downstream of the TAG stop codon. 
This is shown in further detail in Figure 5. The 
sequence forms a strong step-loop structure having a 
stem length of 15 bp and a seven base loop similar 
to the rho- independent terminators from coli. 
The 5' sequence to ORF1 has a region strongly rich 
in A;T containing the hom-thrB promoter and site of 
action of the methionine mediated repression. 

The DNA sequence of the chromosomal DNA insert 
in pFS80, encoding threonine synthase (thrC) , was 
also determined by dideoxy sequencing techniques and 
is shown in Figure 4. A restriction map was pre- 
dicted and checked against restriction analysis 
results to corroborate the accuracy of the sequence 
data. Computer aided analysis (UStiGCG Programs, UW 
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Biotechnology Center, University of Wisconsin) was 
used to predict the thr C gene within the sequence 
data. These results were compared with in vivo 
genetic deletion analysis. An open reading frame 
extends 5* to GTG at 396) marking the amino terminal 
region of thrC. 

The threonine synthase activity maps within the 
1.57 kb Bcll-Stul restriction fragment. The 
computer predicted structural gene sequence, GTG 
(396) to TAA (1881), lies completely within this 
fragment. The StuI restriction is 176 bp 3' of the 
preducted translation stop codon. 

Heterospecific genetic complementation of the 
E. coli thrC 1001 auxotroph shows that the c. 
glutamicum thrC gene is expressed in coll . By 
comparison, using computer searches for regions 
similar to Ej_ coli ribosome binding sites and 
translation terminator sequences, a ribosome binding 
site adjacent to GTG (396) and a significant termina- 
tor-like sequence 35 bp 3 1 of the TAA at 1881 were 
identified. Homology was detected between C. 
glutamicum and E^ coli thrC regions at both DNA and 
predicted protein sequence levels. Limited 
conservation of DNA sequence was observed between 
the E^ coli thrC gene and the region 400 to 1400 bp 
of the glutamicum thrC sequence. There is 
consistent conservation in the central region 
(residues 100 to 350 of glutamicum thr C) and the 
carboxy terminal residues 43 0 to 480. 
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Identification of the hom-thrB transcription start- 
site by SI nuclease mapping and deletion analysis . 

The transcriptional start site for the C. 
glutamicum hom-thrB genes was identified using SI 
nuclease mapping, as described by Berk and Sharp, 
Cell 12, 721 (1977). The procedure requires the 
isolation and denaturation of a DNA fragment which 
overlaps the promoter and has been label at the 
5 1 end of the antisense strand. Hybridization of 
this fragment to its cognate mRNA and subsequent 
digestion with the single strand specific exo- 
nuclease SI results in the degradation of the 3' end 
of the labled DNA fragment up to the point at which 
it is protected by the RNA* The size of the result- 
ing DNA fragment is determined by comigration with 
DNA fragments resulting from the sequencing reac- 
tions of Maxam and Gilbert, Methods in Enzymol. 65, 
499-559 (1982). This enables the identification of 
the transcriptional start site- The results can 
then be confirmed by deletion analysis of the 
promoter using restriction enzymes and exonuclease 
Bal 31 to construct series of deletions which are 
then reinserted into the organism and assayed for 
activity. 

The Smal-Hindlll restriction fragment that 
encompasses the hom - thr B promoter/operator and the 
first seven amino acid residues of the horn gene 
product was used in the Si nuclease mapping studies. 
Plasmid pRAl (pUC18 containing the 3.6 kb Sai l C. 
glutamicum genomic fragment encoding hom-thrB) was 
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cut with Hindlll to generate a 1.014 kb restriction 

fragment, dephosphorylated with CIP (calf intestine 

phosphatase, Boehringer-Mannhein Biochemicals, 

Indianapolis, IN), labelled by treatment with 

32 

polynucleotide kinase and gamma P-ATP (specific 
activity greater than 5000 Ci/mmol, Amersham Corp. 
Arlington Heights, IL) , subsequently cleaved with 
Sma l (New England Biolabs, Beverly, MA) to produce a 
242 bp DNA fragment that was then purified by 
preparative polyacrylamide gel electropheresis. All 
manipulations were carried out in accordance with 
procedures described in Molecular Cloning by T. 
Maniatis et al. (Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY, 1982) and the enzyme suppliers 
recommendations . 

Understanding the expression of a gene requires 
the isolation and structural characterization of the 
mRNA product, the size and number of transcripts, 
the regulatory control and the site of transcription 
initiation. Criteria evaluated for RNA isolated 
from Corynebacteria include RNA quality (Abs 26Q : 
Abs 28Q , ratio of about 1.95 to 2.05), purity 
(degradation and contamination determined by agarose 
gel electropheresis) , and yield in mg RNA/liter 
cells (Abs 26Q - 4 micrograms RNA/ ml) . 

RNA was extracted from C. qlutamicum AS 019 
using the guanidinium isothiocyanate/ French press 
isolation method. In this method, a 1 1. LB culture 
of AS019 is grown at 30°C to late exponential phase 
and harvested by 10 minutes centrifugation at 5000 
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RPM in a Sorvall GSA-250 rotor. The cells are 
washed at 4°C in 0.1 M NaCl, 10 mM Tris.Cl, pH 8.0, * 
1 mM EDTA, and harvested again. The pellets are 
combined in 50 ml 4 M guanidinium isothiocyanate, 
2-mercaptoethanol (GuT:2ME) and immediately lysed by 
compression through a French press at approximately 
1500 psi. Cell debris is sedimented by centrifuga- 
tion in a Sorvall SS-34 rotor for 10 minutes at 
10,000 rpm. Six ml aliquots of the supernatant are 
applied to 4 ml of 5.7 K CsCl, 10 mM ETTA, 25 mM 
sodium acetate and centrifuged at 34 , 000 rpm in a 
Beckman Ti50 fixed angle rotor for 24 hours. 

The density gradient separates the sheared DNA 
molecules from the RNA, which forms a pellet at the 
tube base. This RNA pellet is resuspended in 5 ml 
10 mM Tris, pH 7.5, 1 mM EDTA 5.0% Sarkosyl (TESK) 
containing 5.0% phenol* The solution is made 0.1 M 
with 5 M NaCl, and extracted with 10 ml 50% phenol, 
49% chloroform, 1% isoamylalcohol (PCIA) . The 
phases are separated by centrifugation in a Sorvall 
SS-34 at 3,000 rpm for 5 minutes and the phenolic 
phase back extracted with TESK containing 0.1 M 
NaCl. The combined aqueous phases are made 0.2 M 
with sodium acetate, pH 5.5, and the RNA precipi- 
tated overnight at -20 °C in 2.5 volumes of ethanol. 
After centrifugation at 10,000 rpm for 20 minutes at 
4°C, the RNA pellet is washed in ethanol, dried 
under vacuum, and resuspended in RNase free water at 
a concentration of 0.5 mg/ml. * 
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Total cellular RNA, isolated from AS019 grown 
in minimal media with and without L-methionine (400 
microgram/ml) supplementation, was separated by 
agarose gel electrophoresis, transferred to nitro- 
cellulose paper and probed either with pMF-L2 or 
pUC-B5. Plasmid pMF-L2 contains a 1.8 kb Nae l 
fragment which spans both the horn and thr B genes but 
contains no flanking sequences. RNA is glyoxylated 
to prevent spurious electrophoretic patterns caused 
by potential secondary structure. For each lane, 20 
micrograms of C. qlutamicum RNA is suspended in 8 
microliters of glyoxal reaction mixture (1 M 
glyoxal, 50% DMSO, 10 mM potassium phosphate, pH 
7.0) and incubated 1 hour at 50 °C. Glyoxylated RNA 
samples are prepared for loading by the addition of 
17 microliters formamide, 6.2 microliters formalde- 
hyde, 3 microliters lOx running buffer 0.2 M 
morpholinopropanesulfonic acid (MPOS) , 50 mM sodium 
acetate, 10 mM EDTA) and 5 microliters loading dye 
(50% glycerol, 1 mM EDTA, 0.4% bromophenol blue, 
0.4% xylene cyanol) . Samples are loaded onto an 
agarose/f ormaldehyde gel (2.2% agarose, 1 x running 
buffer, 18% formaldehyde, pH adjusted to 7.0 with 
NaOH) and electrophoresed at 30 mA. Hin di I I re- 

3 2 

stricted lambda DNA is labeled with P, denatured 
and glyoxylated similar to RNA samples and utilized 
as a molecular size standard. Following electro- 
phoresis, nucleic acid is transferred to a nitro- 
cellulose filter using the technique of Southern, J. 
Mol. Biol. 98, 503-517 (1975) except that no prior 
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treatment of the gel was necessary. Following 

transfer for 15 hours/ the filters are baked in 

vacuo at 80 °C for 2 hours. 

The filters are prehybridized in sealed plastic 

bags for 16 hours at 42° C in a minimum volume, 

approximately 10 mis of hybridization buffer (50% 

deionized formamide, 5 x SSC, 50 mM sodium acetate, 

pH 6.5, 25 micrograms sonicated denatured salmon 

sperm DNA, 0.02% bovine serum albumin, 0.02% Ficoll, 

0.02% polyvinyl pyrrolidone) . (SCC = 0.15 M NaCl. 

0.015 M sodium citrate, pH 7.0). The DNA probe is 
32 

labeled with P by nick translation, according to 
Rigby et al., J. Mol. Biol. 113, 237-251 (1977), 
heat denatured, added to the hybridization buffer, 
and incubated with the filter for 20 hour at 42 *C. 
Filters are subsequently washed five times in 2 x " 
SSC/0.1% SDS at room temperature and then three 
times in 0.2 SSC at 50 °C. After drying, the filters 
are exposed to X-ray film and specific bands of 
hybridization determined by autoradiography. 

Hybridization of pMF-L2 to total C. glutamicum 
RNA leads to the appearance of a single 2.4 kb 
transcript. This observation is in agreement with 
the predicted size of the hom-thr B transcript (2408 
base pairs) , based on SI nuclease mapping, and the 
computer predicted termination site of the thr B 
gene. The size of the observed transcript and the 
lack of a detectable second transcript hybridizing 
to the hom-thr B probe leads the conclusion that C. 
glutamicum expresses horn and thr B from a single 
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transcriptional unit, representing the first defined 
operon in this organism. Results obtained by SI 
nuclease mapping of the thrA-thrB junction support 
this conclusion. 

Hybridization of hom - thr B specific RNA to the 
242 bp Sma l- Hind DIII end-labeled probe was achieved 
by lyophilizing 30 micrograms C. glutamicum RNA with 
10 ng probe DNA and resuspending in 10 microliters 
hybridization buffer (40 mM PIPES pH 6.4, 0.4 M 
NaCl, 1 mM EDTA, 80% deionized formamide). The DNA 
was denatured by heating at 90 °C for 10 minutes. 
Hybridization was performed overnight at 49 °C. 

The hybrid DNA-RNA molecules were digested with 
2,000 units SI nuclease (Bethesda Research Labora- 
tories, Inc., Gaithersburg, MD) in 235 microliters 
assay buffer (250 mM NaCl, 30 mM sodium acetate, 10 
mM zinc sulphate, 2 00 microgram/ml calf thymus DNA). 
The digest was incubated at 37 °C for one hour and 
terminated by extraction with 250 microliters PCIA 
(phenal/chloroform/isoamyl alcohol, 50:48:2). 
Nucleic acids were precipitated from the aqueous 
phase with 0.2 M sodium acetate, 2 micrograms yeast 
tRNA and 50 microliters ethanol at -2 0 °C. Following 
centrifugation and drying, each sample was dissolved 
in 3 microliters formamide loading buffer (100 ml 
formamide, 0.72 g Na 2 EDTA, 0.03 g bromophenol blue) 
and applied to a 6% polyacrylamide/7 M urea sequenc- 
ing gel. The 242 bp Smal-Hindlll restriction 
fragment was sequenced using the procedures for the 
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G, C+T and c reactions as described in Maxam and 
Gilbert, Methods in Enzymol. 65, 499-559 (1982). 

The Smal-ffindlll fragment labeled at the 
Hin dlll 5 'terminus (antisense strand) acts as a 
specific probe to RNA complementary to this region. 
To detect the start of the hom-thrB mRNA transcript, 
total RNA is hybridized to the labeled probe and the 
unprotected single stranded nucleic acid digested 
with the single strand, specific SI nuclease. The 
length of the protected region of the DNA probe is 
resolved on a denaturing 6% polyacrylamide gel. The 
hom - thrB transcript initiates at coordinate 906, at 
the first of a GG doublet. This nucleotide is 88 bp 
5 1 of the first available ATG codon in the horn open 
reading frame . This defines the promoter .region 
responsible for hom-thr B expression and is 
designated thrPl for threonine promoter 1. The 
sequence is shown in Figure 5a. 

No detectable degradation of the DNA probe from 
the 0.46 kb Fokl- Pvull restriction fragments span- 
ning the horn thr B Pvu ll junction, indicating that 
the majority of the thr B expression was mediated by 
thrPl . 

Identification and deletion of the operator 
mediating L-methionine repression of hom-thrB 
expression. . 

In addition to promoter identification, 
restriction and/or exonuclease Bal 3 1 deletions have 
been utilized in identification and deletion of the 
operator (thrO) , which mediates the L-methionine 
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repression of hom-thrB expression and in the con- 
struction of a feedback inhibition deficient variant 
of the hoin gene. These studies were facilitated by 
construction of a special vector designated pWSTl. 
When investigating promoter structure and function 
on a plasmid, it is desirable to eliminate read 
through transcription from upstream promoters 
located within the cloning vector. Plasmid pWSTl 
contains the E. coli trpA terminator followed by a 
polylinker to facilitate the cloning of the gene in 
the various deletion generated variants. The effect 
of the deletions can be assayed in the absence of 
influence by upstream promoters. This vector is 
applicable not only to the analysis of the hom ^ thrB 
genes, but also the characterizations of other 
promoter/ operator systems in C. glut ami cum . pWSTi 
is constructed using the trpA terminator obtained 
from Pharmacia Fine Chemicals, Piscataway, NJ. Sacl 
linkers are. attached and the trpA terminator in- 
serted into the polylinker region of M13mpl9. The 
constructs are sequenced using the method of Sanger 
et al. Proc. Natl. Acad. Sci. USA 74,5463-5467 
(1977), to screen for insertion of the terminator in 
the proper orientation. The terminator/polylinker 
is subsequently ligated into the Smal-Sall re- 
stricted pTF33, a derivative of the C . qlutamicum/E . 
coli shuttle vector pWS124, described by Batt et 
al. , Biotechnol. Letts. 7:717 (1985). DNA linkers 
and enzymes are obtained from New England Biolabs or 
Boehringer-Mannheim, as noted earlier. 
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The hom-thrB genes ligated into Sma l/ Sal l re- 
stricted pWSTl on a 2911 bp Smal-Sall restriction 
fragment and a 2,815 bp Dral-Sall restriction frag- 
ment, designated pWFS2.9 and pWFS2.8, respectively. 
Further deletion of the hom-thrB upstream region is 
accomplished as diagramed in Figure 6. The recombi- 
nant vector pWFS2.9 was linearized by Sma l digestion 
and deletions constructed by digestion of 6 micro- 
grams of DNA with, the exonuclease Bal 31 (0.2 units}/ 
micrograms DNA. Aliquots of the reaction mixture 
were removed at 30 second intervals between 4 and 15 
minutes r and the reaction stopped by dilution into 
one volume of 50 mM EDTA. The DNA was digested with 
Sail and the resulting hom-thrB containing fragments 
purified by agarose gel electrophoresis* These 
fragments were ligated into Sma l -Sal l digested pWSTl 
and the resulting recombinant mixture used to 
transform C. glutamicum AS 25 3 ( horn ) . - 

The extent of the Bal 3l generated deletions, 
diagrammed in Fig. 7,. in complementing a non- 
complementing derivative plasmid is- determined by 
nucleotide sequence analysis and measurement of the 
levels of homoserine dehydrogenase activity in crude 
extracts. The ability of the deletion plasmids to 
complement the hom - thrB auxotrophy of strain AS253 
was checked by streaking the corresponding AS253 
transformants onto MCG/kanamycin agar plates. 

The results of the deletion construction and 
their effect on horn gene expression show that - 
deletion of sequences upstream of horn , up to the 
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Sma l (pWFS2.9) or Dra l (pWFS2.8) restriction site 
(218 and 124 bp prior to the predicted translational 
start site, respectively) does not drastically 
influence the expression of the horn gene. 

As predicted from the SI nuclease studies, the 
Dra l- Sal l hom-thrB fragment contains both the -10 
and -35 regions which are critical for promoter 
activity in E. coli , as reported by Hawley and 
McClure, Nucleic Acid Res. 11 ; 223 2 (1983). Further 
deletion by Bal 31 markedly reduces the expression 
of the horn gene product homoserine dehydrogenase, 
supporting the data obtained with SI nuclease 
mapping of thrPl. Two of the deletion derivatives, 
pWFSdelta2304 and pWFSdelta2207 are able to comple- 
ment C. 'glutamicum AS253 despite the reduction in 
horn expression to 2 and 10%, respectively, of that 
observed in strains containing the parental plasmid 
pFS3.6. The relative specific activity of homo- 
serine dehydrogenase observed in C. glutamicum AS253 
( horn ) harboring these, two deletion derivatives is 
3.1 and 0.7 with respect to that observed in wild 
type strains. C. glutamicum auxotrophs requiring 
threonine/methionine express approximately 2% of the 
wild type level of homoserine dehydrogenase activ- 
ity. The deletion of the horn promoter carried in 
pWFSdelta2431 results in a 96-fold decrease in the 
expression of the cloned thr B gene thus demonstrat- 
ing a common promoter. 

The Bal 31 generated deletions enable the 
mapping of the boundary between those deletion 
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derivatives which complement the threonine/ 
methionine auxotrophy and those that fail to comple- 
ment, between 63 base pairs (pWFSdelta2304) and 56 
base pairs (pWFSdelta2431) upstream of the predicted 
horn translation start sites. The observation that 
deletions extending to locations beyond the putative 
start point of transcription (88 bp upstream of the 
horn start codon) is determined by SI nuclease 
mapping , does not necessarily result in the complete 
loss of homoserine dehydrogenase activity. This 
loss may also be due to weak promoter activity 
adjacent to the main transcription start site. 

The mechanism of transcriptional regulation of 
the hom-thrB operon was determined to involve 
control by the single stem/loop attenuator shown in 
Figure 5. Specif ic- deletion of the stem/loop 
structure removes the methionine repression of 
hom - thr B expression. In this structure, the se- 
quence ATGTAG, encoding Met-Stop, forms the loop. 
The sequence. TTTTGGACA, similar to the TTGGAGA that 
precedes the predicted translational start site of 
the horn gene, precedes the ATG and thus represents a 
potential ribosome binding site. A possible model 
is one in which a bound ribosome can be momentarily 
stalled due to a low concentration of charged 
methionine tRNA, thus preventing stem/loop formation 
and allowing transcription to continue. At higher 
concentrations of methionine, the ribosome would 
move to the TAG Stop signal and disengage, allowing . 
stem/ loop formation. 
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LacZ protein fusions can be used to evaluate 
v promoter and operator functions directly in C. 

glutamicum . This method is demonstrated using the 
hom - thr B promoter/ operator, isolated on a 330 bP 
Sma l- Hin dlll fragment purified and ligated into 
similarly digested pSKS107 that contains a 
promoterless lactose operon. The construction 
creates a hom-lacZ protein fusion containing the 
N-terminal eight horn amino acid residues preceding 
the lacZ gene product beta-galactosidase. The 
expression of the fusion protein products required 
the insertion of a ribosome binding site, initiating 
codon (ATG/GTG) , under the control of the hom - thr B 
promoter/ operator. The recombinant vector was 
introduced into E. coli JM83 where beta- 
galactosidase activity as' observed in crude ex- 
^ tracts. Supplementation of the growth medium with 

L-methionine represses the expression of lac Z 
two-fold. 

Deletion of a portion of the dyad symmetry 
element required for operator function demonstrates 
the role of the dyad symmetry element in the regula- 
tion of the hom - thr B gene. A 1.48 kb Kpn l 
restriction fragment containing the horn gene of the 
Bal31 deletion derivative, pWFSdelta2207 was 
purified and used to replace its counterpart in the 
parental vector pFS3.6A. The resulting recombinant 
plasmid, designated pWFS2207deltal, contains a 
i specific 10 base pair deletion removing the left 

half of the dyad symmetry element. Identical levels 
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of homos erine dehydrogenase activity were measured 
in strains grown in MCG medium with and without 
methionine supplementation. This demonstrates that 
the dyad symmetry element is the site of 
L-methionine repression (thrO) . 
Identification of the thrC promoter by deletion 
analysis , complementation of auxotrophs and 
overproduction of the enzyme » 

The promoter sequence for expression of thrc 
was also determined by deletion analysis, 
auxotrophic complementation, and overproduction of 
enzyme. 

The overproduction of the product of the thrC 
gene, threonine synthetase, can be measured from 
crude extracts of C. glutamicum strains AS213 and 
wild type AS 019 containing the parental vector 
pHY416 or the thr C containing pFS80. The two 
strains containing the plasmids are grown in MCG 
medium, the cells harvested, lysed, cell debris 
removed by centrifugation, the protein purified by 
40 to 60% ammonium sulphate fractionation, DEAE- 
Sephadex column chromatography with a 0.2* M to 0.2M 
to 0.6 M KC1 gradient, and anion exchange 
chromatography in a . FPLC column dluted with a 0.1 M 
to 0.7 M KC1 gradient. The results demonstrate that 
the protein is produced at a level 200% of that 
observed in the wild type. The method produces 
threonine synthetase specific activity demonstrating 
a purification of over 350-fold. The protein has a 
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molecular weight of 56,000 by SDS polyacrylamide gel 
electropheres is . 

Analysis of activity complementation of the 
thrC auxotroph indicates that the promoter for the 
thr c gene sequence precedes the predicted transla- 
tion start site for the thrC gene product by 
approximately 80 base pairs. The sequence, TTGAAA — 
(16 bp) — TAGGGT, is closely related to the E. coli 
consensus sequence as well as the promoter sequence 
determined for C. qlutamicum thrPl, AAAGCA — 18 bp — 
TATAGT . Confirmation of the identification of the 
sequences the thr C gene promoter sequence is done by 
SI nuclease analysis. 

Modification of the enzyme structure and expression 
of horn, thrB and thrC . 

Once the horn , thr B and thrC genes are identi- 
fied, including the identification if the initiation 
sites of both mRNA and protein synthesis for the 
genes, it is possible to increase the quantity of 
gene expression by increasing the gene dosage by 
localization of specific genes on a multicopy 
plasmid, by site-directed mutagenesis or replacement 
of the promoter, by increasing translational effi- 
ciency through alteration of the ribosome binding 
site, or by increasing stability of the protein by 
' site-directed mutagenesis. The quality of the 
particular gene can be increased using in vitro and 
site-directed mutagenesis to alter substrate 
* utilization as well as the kinetic and regulatory 

properties of the enzyme. Physical properties such 
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as heat stability can also be modified. The 
construction of a vector with the three genes 
transcribed as a single unit tinder the control of a 
high efficiency promoter results in more efficient 
threonine synthesis. One can also remove the 
^-threonine feedback inhibition of homoserine 
dehydrogenase or the threonine and lysine inhibition 
of aspartokinase to produce the overproduction of 
threonine. The feedback inhibition of homoserine 
dehydrogenase can be removed by in vitro mutagenesis 
using either hydroxylamine and/or sodium bisulfite, 
methods well known to those skilled in the art, or 
by recombinant techniques. The mutagenized plasmids 
are reintroduced into glutamicum and screened for 
AHV resistance or by enzyme assays. Increased 
promoter efficiency can also be accomplished by site 
directed mutagenesis of the existing promoter or by 
replacement with a high efficiency promoter. 

The thrC gene can be placed under the trans- 
criptional control of. high efficiency promoters such 
as the E. coli promoter tac to produce elevated 
levels of the gene product. The expression vector 
pKK233-2, obtained from Pharmacia Fine Chemicals, is 
restricted with Ncol-Hindm. The plasmid PFS80 is 
cleaved with Bel l, blunt ended with Klenow 
polymerase, and Nco l linkers ligated onto the flush 
ends. The ligation product is double digested with 
Ncol and Hindlll, a 2.8 kb fragment purified and 
ligated into similarly digested pKK233-2. The 
resulting recombinant vector designated pKC14 is 
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transformed into E. coli JM105. The requirement for 
IPTG (isopropyl-beta-D-thio-galactopyranoside) 
induction demonstrates that the thr C gene is under 
the transcriptional control of the tac promoter. 
The threonine synthetase activity measured in the 
absence of IPTG was 1.3 nmole/min/mg protein. The 
addition of 2 mM IPTG induces 24 times the threonine 
synthetase (30.9 nmole/min/mg-protein) . 

The homoserine dehydrogenase and homoserine 
kinase polypeptides encoded by the open reading 
frames corresponding to the horn and thr B gene 
products can be expressed and purified for analysis. 
The enzymes are purified from 10 liters of MCG broth 
from a CHEMAP fermentator innoculated with a 300 ml 
overnight culture of C. glutamicum AS019/pFS3.6 
grown for 24 hours at 30 °C with 470 rpm agitation. 
Cells were harvested by ultrafiltration and centri- 
fugation, the cell pellet resuspended in lysis 
buffer (100 mM KP0 4 , pH 7.0, 0.5 M KC1) and the 
cells lysed by repeated passage through a French 
pressure cell. Debris is removed by centrifugation, 
the supernatant precipitated with ammonium sulphate, 
and the enzyme activities separated on a DEAE- 
Sephadex A-50 column eluted with a linear 0.3 M to 
0.8 M KC1 gradient. Fractions containing 'the 
appropriate enzyme activity are pooled and the 
proteins analyzed by SDS polyacrylaraide electro- 
phoresis. The proteins are then further purified on 
hydroxylapatite HPLC prior to final separation by 
preprative SDS-PAGE. The purified homoserine 
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dehydrogenase remained active through the procedure 
and has a final molecular weight of 47,000 daltons. 
The activity of the homoserine kinase is lost, but 
the protein had a molecular weight of 32 f 000. 

The observed molecular weights are in close 
agreement with the molecular weights predicted from 
the nucleotide sequences. The NH^ terminus of the 
homoserine dehydrogenase is blocked, however, the 
amino acid composition is in good agreement with the 
amino acid composition predicted from the gene 
sequence. The expression of activity by ligation of 
the Smal -Hindi 11 fragment containing the C. " 
glutamicum thr Pl and predicted NH 2 -terminal seven 
amino acid residues of the horn gene product indicate 
that the N-terminal sequence is correct. The first 
ten residues of the thr B gene product, homoserine 
kinase, is in complete agreement with the predicted 
amino acid sequence. This, identifies the 
translation initiation site for the thr B gene as the 
ATG at nucleotide 2342, confirming the predicted 
primary structure of the C. glutamicum homoserine 
kinase. The protein appears to undergo post- 
translational removal of the N-formyl-MET, a rela- 
tively common feature of procaryotic proteins. 
Construction and application of a C. glutamicum 
promoter probe . 

A promoter probe for use in identifying, 
isolating, and quantifying the efficiency of 
promoters is based on pWSTl and designated pAL-1. 
The chloramphenicol acetyl transf erase (CAT) gene is 
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used as the promoter probe gene. The cat gene is 
expressed in E. coli. , B. subtilis , and C. 
glutamicum . Since cloning of strong promoters can 
induce plasmid instability by transcriptional 
interference at the replication origin, it may be 
necessary to clone the fd enteric bacteriophage 
major gene terminator at the 3' terminus of the test 
gene. Potential promoter sequences can then be 
screened for the acquisition of chloramphenicol 
resistance. Preliminary estimation of promoter 
efficiency can be accomplished by determining the 
extent of antibiotic resistance. 

A number of promoter sources can be screened 
for their efficiency in C . glutamicum . High effi- 
ciency promoters from other procaryotic systems are 
known, for example, £. coli trp, the hybrid P tac , 
lambda pH, and B. subtilis Preg. Random C. gluta- 
micum chromosomal DNA and/or corynephage DNA frag- 
ments can also be inserted into the polylinker site, 
upstream in the test gene and cat activity deter- 
mined to assess promoter efficiency. DNA sequencing 
and determination of the transcriptional start sites 
are used to characterize the promoter structure. 
For example, the 266 bp Dra l- Hae lll fragment which 
spans the predicted thr C promoter region was 
purified and ligated into the Sma l restriction site 
of the pAL-1 polylinker. The resulting recombinant 
mixture was introduced into £. coli and 
chloramphenicol resistant transf ormance obtained. 
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The primary site of metabolic regulation of 
threonine biosynthesis in C. glutamicum is the 
L- threonine inhibition of homoserine dehydrogenase* 
To remove the metabolic block, in vitro mutogenesis 
has been used to alter the hom gene product to 
produce feedback inhibition deficient variants. The 
recombinant plasmid pWFSdelta2207 was used as a 
source of the hom gene. This plasmid expresses a 
lower level of homoserine dehydrogenase (10%) than 
pFS3.6A, eliminating potential artifacts due to the 
overproduction of the hom gene product • Plasmid DNA 
was digested with Kgnl, separated by agarose gel 
electrophoresis and the fragments isolated by 
electroelution into dialysis bags. The 1.43 kb &pn l 
fragment containing the hom gene was purified and 
treated with hydroxylamine. This is a potent 
mutagen primarily causing AT to GC and GC to AT 
transitions. The mutated hom gene was isolated and 
3 micrograms of target DNA resuspended in 280 
microliters of 1 M hydroxylamine , 0.3 M KPC> 4 , pH 
6.0, aliguots removed at between 10 and 3 00 minutes 
and the reaction stopped. by ethanol precipitation. 
The mutagenized fragments were religated to the 
large,, 12.7 kb, Kpn l restriction fragment and 
transformed into E. coli JM83. 

Plasmid DNA from ampicillin resistant trans- 
formants was purified on Cscl gradients and trans- 
formed into the restriction deficient C. glutamicum 
AS019-E12. These transf ormants were then screened 
for resistance to alpha-aminohydroxyvaleric acid 
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(AHV) in order to select for a deregulated hom gene 
product. Homoserine dehydrogenase activity assays 
were used to confirm and demonstrate the removal of 
the L-threonine mediated feedback inhibition. 
Different mutations could be combined by recombinant 
DNA techniques to determine the extent to which they 
are cooperative. 

The present invention, nucleotide sequences 
encoding threonine biosynthetic enzymes, and methods 
and sequences for the expression and regulation of 
expression of these enzyme encoding sequences are 
disclosed. Modifications and variations of this 
invention will be obvious to those skilled in the 
art of genetic engineering from the foregoing 
detailed description. It is intended that these 
modifications and variations will fall within the 
scopes of the appended claims. 
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WE CLAIM: 

1. A method for the production of threonine 
comprising: 

providing nucleotide sequences for the 
genes encoding the enzymes in the threonine 
biosynthetic pathway; and 

inserting the enzyme encoding sequences 
with selected nucleotide sequences mediating 
the expression and regulation of the enzyme 
encoding sequences into an expression vector. 

2. The method of claim 1 further comprising 
inserting said expression vector into an 
expression host. 

3. The method of claim 2 wherein said expression 
host is corynebacterium .' 

4. The method of claim 1 wherein said nucleotide 
expression and regulation sequences include a 
promoter, further comprising selecting a 
promoter having a higher efficiency than the 

. promoter associated with the enzyme encoding 
chromosomal genes and selecting for greater 
efficiency. 

5. The method of claim. 4 wherein said higher 
efficiency promoter is obtained by mutating the 
promoter associated with said enzyme encoding 
genes. 
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6. The method of claim 4 further comprising 
selecting said high efficiency promoter from 
the group of promoters isolated from 
Escherchia , Bacillus, Staphylococcus and 
Streptococcus . 

7. The method of claim 1 further comprising 
selecting a multicopy plasmid as the expression 
vector. 

8. The method of claim 1 wherein said expression 
and regulation sequences include a ribosome 
binding site, further comprising selecting for 
a ribosome binding site with increased 
efficiency. 

9. The method of claim 1 further comprising 
combining said enzyme encoding nucleotide 
sequences in a single expression vector. 

10. The method of claim 1 further comprising 
mutating said enzyme encoding nucleotide 
sequences and selecting for temperature 
stability. 

11. The method of claim 1 further comprising 
mutating said enzyme encoding nucleotide 
sequences and selecting for substrate util- 
ization. 
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12. The method of claim 1 wherein said expression 
and regulation signals include a repressor, 
further comprising modifying said repressor. 

13 . The method of claim 12 wherein said repressor 
is deleted. 

14. The method of claim 12 wherein said repressor 
is mutated. 

15. The method of claim 12 wherein said repressor 
is. replaced with a repressor other than the 
repressor associated with the chromosomal gene 
encoding said sequence. 

t . 

16. A method of constructing a promoter probe for 
Corynebacterium amino acid genes comprising; 
isolating nucleotide sequences encoding a 
detectable protein product involved in 
threonine biosynthesis in a Corynebacterium 
host, 

constructing deletions of the 3 1 end of said 
nucleotide sequences, 

inserting said deletions into an expression 
vector, 

transforming said vector into an auxotrophic 
Corynebacterium host, and 

determining if said deletions produce protein 
in auxotrophic Corynebacterium hosts. 
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17. The method of claim 16 further comprising 
sequencing the nucleotide sequences comple- 
menting the auxotrophic Corynebacterium hosts. 

18. A rDNA sequence comprising hom / thr B, and thr C. 

19. The rDNA sequence of claim 18 further 
comprising a promoter sequence. 

20. The rDNA sequence of claim 19 further 
comprising a repressor sequence. 

21. The rDNA sequence of claim 19 wherein said 
promoter sequence is selected from sequences 
having a higher translational efficiency than • 
the sequences associated with the chromosomal 
DNA. 

22. A Corynebacterium rDNA promoter sequence 
comprising a ribosome binding site TTGGAGA. 

23. A nucleotide sequence hybridizing to a rDNA 
sequence encoding homoserine dehydrogenase in 
Corynebacteria . 



24. 



A nucleotide sequence hybridizing to a rDNA 
sequence encoding homoserine kinase in Coryne- 
bacteria. 
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25. A nucleotide sequence hybridizing to a rDNA 
sequence encoding threonine synthase. 

26. A Corynebacteriuitt rDNA translation termination 
sequence comprising 

AAGGAAGGCCCCTTCGAATCAA-GA 
AGGGGCCTT. 

27. A Corynebacterium rDNA translation termination 
sequence comprising 

G,ATGGAACCAG GCCTTTCGCATTG 
AGTGGCGTTTTAAGGCCTCCA. 

28. A Corynebacterium rDNA sequence repressing 
translation in the presence of "excess . 
methionine comprising 

T T T G T T. T T G G A C A C A T G T T C TAG G 

met stop 

GTGGCCGAAACAAA. 
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FIGURE 3 

1 GTCGACCGCGTGAAGTCGCCCTTTAGGAGAATTCTGACTAACTGGAGCCAAAACTTGATC 60 

6 1 CACTCGAGAGCTGTGCAGTCTCTTTTTCCTTCAATTCTGCCTGCTCGAGCTCGTAGAAGT 120 

121 AGAGGTCTACTTCAGTTGGTTCACCTTGCACACAAGCATGAAGTAGTGGGTAGGTCGAGT 180 

181 TGTTAAATGCGGTGTAGAAGGGGAGTAGTTCGCTAGCAAAGGTTAATTTGGAGTCGCTGT 240 

241 ACTGCGGGTTCTCGGGTGGAGTATTCCCGGAGGATTCAAGAAATCTTGACGCATCTTTGA 300 

301 TGAGGTATGTTTGGAATTCGTCGGCACCTTCCTCGCCGGAGAGGTAGTAGGAGTTGTCGT 360 

36 1 AATTTGGAACCCAGATGGCAAATCGTGCGTTTTCGATTGCGTCCAGGACTTCCTCTACGT 420 

421 TGTATCTCGCACTTGTTGCAGCGGAAGCGACTCGGTTGCCGATGTCTCCGTATGCAGTGA 480 

481 GCGTGGCGTTTCCGAGGGGAACTTGATCAGAGGAATACACCATGGAGCCGATGTCAGAGG 540 
541 CGACTGCGGGCAGATCCTTTTGAAGCTGTTTCACAATTTCTTTGCCCAGTTCGCGGGGGA * 600 

601 TCTGGAACCACTTTTGCATGCGATCGTCGTCAGAGTGGTTCATGTGAAAAATACACTCAC 660 

661 CATCTCAATGGTCATGGTGAAGGCCTGTACTGGCTGCGACAGCATGGAACTCAGTGCAAT 720 

721 GGCTGTAAGGCCTGCACCAACAATGATTGAGCGAAGCTCCAAAATGTCCTCCCCGGGTTG 780 
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FIGURE 3 (CONT'D) 



781 ATATTAGATTTCATAAATATACTAAAAATCTTGAGAGTTTTTCCGTTGAAAACTAAAAAG 840 



84 1 CTGGGAAGGTGAATCGAATTTCGGGGCTTTAAAGCAAAAATGAACAGCTTGGTCTATAGT 900 



901 GGCTAGGTACCCTTTTTGTTTTGCACACATGTAGGGTGGCCGAAACAAAGTAATAFFACA 961 



MetThrSerAlaSerAlaProSerPhe 
961 ACAACGCTCGACCGCGATTATTTTTGGAGAATCAIGACCTCAGCATCTGCCCCAAGCTTT 1020 

Translation Initiation Codon 



AsnProGlyLysGlyProGiySerAlaValGlylleAlaLexaLeuGlyPheGlyThrVal 
1021 AACCCCGGCAAGGGTCCCGGCTCAGCAGTCGGAATTGCCCTTTTAGGATTCGGAACAGTC 1080 



GlyThrGluValMetArgl^uMetThrGluTyrGlyAspGliiLeuAlaHisArglleGly 
1081 GGCACTGAGGTGATGCGTCTGATGACCGAGTACGGTGATGAACTTGCGCACCGCATTGGT 1140 



GlyProLeuGluValArgGlylleAlaVaiSerAspIleSerLysProArgGluGlyVai 
1141 GGCCCACTGGAGGTTCGTGGCATTGCTGTTTCTGATATCTCAAAGCCACGTGAAGGCGTT 1200 



AlaProGluLeuLeuThrGluAspAlaPheAlaLeuIleGluArgGluAspValAspIle 
1201 GCACCTGAGCTGCTCACTGAGGACGCTTTTGCACTCATCGAGCGCGAGGATGTTGACATC 1260 



ValValGluVallleGlyGlylleGluTyrProArgGluValValLeuAlaAlaLeuLys 
1261 GTCGTTGAGGTTATCGGCGGCATTGAGTACCCACGTGAGGTAGTTCTCGCAGCTCTGAAG 1320 



MaGlyLysSerValValThrMaAsnLysAlaLeuValAlaAlaHisSerAlaGliiLeu 
1321 GCCGGCAAGTCTGTTGTTACCGCCAATAAGGCTCTTGTTGCAGCTCACTCTGCTGAGCTT 1380 



AlaAspAlaAlaGluAlaAiaAsnValAspLeuTyrPheGluAlaAlaValAlaGlyAla 
1381 GCTGATGCAGCGGAAGCCGCAAACGTTGACCTGTACTTCGAGGCTGCTGTTGCAGGCGCA 1440 
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FIGURE 3 (CONT'D) 



IleProValValGlyProLeuArgArgSerLeuAlaGlyAspGlnlleGinSerValMet 
1441 ATTCCAGTGG1TGGCCCACTGCGTCGCTCCCTGGCTGGCGATCAGATCCAGTCTGTGATG 1500 

GlylleValAsnGlyThrThrAsnPhelleLeuAspAlaMetAspSerThrGlyAlaAsp 
1501 GGCATCGTTAACGGCACCACCAACTTCATCTTGGACGCCATGGAITCCACCGGCGCTGAC 1560 

TyrAlaA8pSerLeuAlaGluAlaThrArgLeuGlyTyrAlaGluAlaAspProThrAla 
1561 TATGCAGATTCTTTGGCTGAGGCAACTCGTTTGGGTTACGCCGAAGCTGATCCAACTGCA 1620 

AspValGluGlyHiaAspAlaAlaSerLysAlaAlalleLeuAlaSerlieAlaPleHia 
1621 GACGTCGAAGGCCATGACGCCGCATCCAAGGCTGCAATTTTGGCATCCATCGCTTTCCAC 1680 

ThrArgValThrAlaAspAspValTyrCysGluGlylleSerAsalleSerAlaAlaAsp 
1681 ACCCGTGTTACCGCGGATGATGTGTACTGCGAAGGTATCAGCAACATCAGCGCTGCCGAC 1740 

IleGluAlaAlaGlnGlnAiaGlyHisThrlleLyaLeuLeuAlalleCysGluLysPhe 
1741 ATTGAGGCAGCACAGCAGGCAGGCCACACCATCAAGTTGTTGGCCATCTGTGAGAAGTTC 1800 

ThrAsnLysGluGlyLysSerAlalleSerAlaArgValHisProThrLeuLeuProVal 
1801 ACCAACAAGGAAGGAAAGTCGGCIATTTCTGCTCGCGTGCACCCGACTCTATTACCTGTG 1860 

SerHisProLeuAlaSerValAsnLysSerPheAsnAlallePheValGluAlaGluAla 
1861 TCCCACCCACTGGCGTCGGTAAACAAGTCCTTTAATGCAATCTTTGTTGAAGCAGAAGCA 1920 

AlaGlyArgLeuMetPheTyrGlyAsnGlyAlaGlyGlyAlaProThrAlaSerAlaVai 
1921 GCTGGTCGCCTGATGTTCTACGGAAACGGTGCAGGTGGCGCGCCAACCGCGTCTGCTGTC 1980 

LeuGlyAspValValGlyAlaAlaArgAsnLysValHisGlyGlyArgAlaProGlyGlu 
1981 CTTGGCGACGTCGTTGGTGCCGCACGAAACAAGGTGCACGGTGGCCCTGCTCCAGGTGAG 2040 
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FIGURE 3 (CONT'D) 



SerThrTyrAlaAsnLeuProIleAlaAspPheGlyGluThrThrThrArgTyrHisLeu 
2041 TCCACCTACGCTAACCTGCCGATCGCTGATTTCGjTGAGACCACCACTCGTTACCACCTC 2100 

AspMetAspValGlnAspArgValGlyValLeuAlaGluLeuAlaSerLeuPheSerGlu 
2101 GACATGGATGTGGAAGATCGCGTGGGGGTTTTGGCTGAATTGGCTAGCCTGTTCTCTGAG 2160 



GlnGlylleSerLeuArgThrlleArgGlnGluGluArgAspAspAspAlaArgLeuIle 
2161 CAAGGAATCTCCCTGCGTACAATCCGACAGGAAGAGCGCGATGATGATGCACGTCTGATC 2220 



ValValThrHisSerAlal-euGluSerAspLeuSerArgThrValGluLeuLeuLysAla 
2221 GTGGTCACCCACTCTGCGCTGGAATCTGATCTTTCCCGCACCGTTGAACTGCTGAAGGCT 2280 



LysProValValLysAlalleAsnSerVallleArgLeuGluArgAsp 
2281 AAGCCTGTTGTXAAGGCAATCAACAGTGTGATCCGCCTCGAAAGGGACTAATXTTACTGA 2340 

Stop 



Predicated start of thrB translation 

MetAlalleGluLeuAsnValGlyArgLysValThrVaiThrValProGlySerSerAl 
2341 CATGGCMTTGAACTGAACGTCGGTCGTAAGGTTACCGTCACGGTACCTGGATCTTCTGC 2400 
Translation initiation Codon 



aAsnLeuGlyProGlyPheAspThrLeuGlyLeiiAlaLeuSerValTyrAspThrValGl 
2401 AAACCTCGGACCTGGCTTTGACACTTTAGGTTtGGCACTGTCGGTATACGACACTGTCGA 2460 



uValGluIlelleProSerGlyLeuGluValGluValPheGlyGluGlyGlnGlyGluVa 
246 1 AGTGGAAATTATTCCATCTGGCTTGGAAGTGGAAGTTTTTGGCGAAGGCCAAGGCGAAGT 2520 



LProLeuAspGlySerHisLeuValValLysAlalleArgAlaGlyLeuLysAlaAlaAs 
2521 CCCTCTTGATGGCTCCCACCTGGTGGTTAAAGCTATTCGTGCTGGCCTGAAGGCAGCTGA 2580 



pAlaGluValProGlyLeuArgValValCysHisAsnAsnlleProGlnSerArgGlyLe 
2581 CGCTGAAGTTCCTGGATTGCGAGTGGTGTGCCACAACAACATTCCGCAGTCTCGTGGTCT 2640 
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FIGURE 3 (CONT'D) 



UGlySerSerMaMaAlaAlaValAlaGlyValAlaAlaAlaAsnGlyLeiiAlaAspPh 
2641 TGGCTCCTCTGCTGCAGCGGCGGTTGCTGGTGTTGCTGCAGCTAATGGTTTGGCGGATTT 2700 



EProLeuThrGlnGluGlnlleValGlnLeuSerSerAlaPheGluGlyHisProAspAs 
2701 CCCGCTGACTCAAGAGCAGATTGTTCAGTTGTCCTCTGCCTTTGAAGGCCACCCAGATAA 2760 

nAlaAlaAlaSerValLeuGlyGlyAlaValValSerTrpThrAsnLeuSerlleAspGl 
2761 TGCTGCGGCTTCTGTGCTGGGTGGAGCAGTGGTGTCGTGGACAAATCTGTCTATCGACGG 2820 



yLysSerGlnProGlnTyrAlaAlaValProLeuGluVaiGlnAspAsnlleArgAlaTh 
2821 CAAGAGCCAGCCACAGTATGCTGCTGTACCACTTGAGGTGCAGGACAATATTCGTGCGAC 2880 



rAlaLeuVaiProAsnPheHisAlaSerThrGluAlaVaiArgArgValLeuProThrGi 
2881 TGCGCTGGTTCCTAATTTCCACGCATCCACCGAAGCTGTGCGCCGAGTCCTTCCCACTGA 2940 



uValThrHisIleAspAlaArgPheAaxiValSerArgValAlaValMetlleValAlaLe 
2941 AGTCACTCACATCGATGCGCGAITTAACGTGTCCCGCGTTGCAGTGATGATCGTTGCGTT 3000 



uGlnGlnArgProAspLeuLeuTrpGluGlyThrArgAspArgLeuHisGlnProTyrAr 
3001 GCAGCAGCGTCGTGAITTGCTGTGGGAGGGTACTCGTGACCGTCTGCACCAGCCTTATCG 3060 



gAlaGluValLeuProIleThrSerGluTrpValAsnArgLeuArgAsttArgGlyTyrAl 
3061 TGCAGAAGTGTTGCCTATTACCTCTGAGTGGGTAAACCGCCTGCGCAACCGTGGCTACGC 3120 



aAlaTyrLeuSerGlyMaGlyProThrAlaMetValLeuSerThrGluProIleProAs 
3121 GGCATACCTTTCCGGTGCCGGCCCAACCGCCATGGTGCTGTCCACTGAGCCAATTCCAGA 3180 



pLysValLeuGluAspAlaArgGluSerGlylleLysValLeuGluLeuGluValAlaGl 
3181 CAAGGTTTTGGAAGATGCTCGTGAGTCTGGCATTAAGGTGCTTGAGCTTGAGGTTGCGGC 3240 
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FIGURE 3 (CONT'D) 

YProValLysValGluValAsnGlnPro 
3241 ACCAGTCAAGGTTGAAGTTAACCAACCTTAGGCCCAACAAGGAAGGCCCCTTCGAATCAA 3300 

Stop Computer predicted 

3301 GAAGGGGCCTTATTAGTGCAGCAATTATTCGCTGAACACGTGAACCTTACAGGTGCCCGG 3360 
translation, intermination point 

3361 CGCGTTGAGTGGTTTGAGTTCCAGCTGGATGCGGTTGTTTTCACCGAGGCTTTCTTGGAT 3420 

342 1 GAATCCGGCGTGGAXGGCGCAGACGAAGGCTGATGGGCGTTTGTCGTTGACCACAAATGG 3480 

3481 GCAGCTGTGTAGAGCGAGGGAGTTTGCTTCTTCGGTTTCGGTGGGGTCAAAGCCCATTTC 3540 

354 1 GCGGAGGCGGTTAATGAGCGGGGAGAGGGCTTCGTCGAGTTCTTCGGCTTCGGCGTGGTT 3600 

3601 AATGCCCATGACGTGTGCCCACTGGGTTCCGATGGAAAGTGCTTTGGCGCGGAGGTCGGG 3660 

3661 GTTGTTGCATTGCGTCATCGTCGAC 3685 
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FIGURE 4 

1 TGACCGAGAGTTTTTTTGAGCAACT6GATCATTAGATAATTGTTCGATCGACCGAATGAA 60 

61 ATCACCCGTTATGGAGACCTACTGGAATTGAGCCCAGAAACCGTCGATGTGTGCCTCAAC 120 

121 GTAGGGGTAAAGCCACGGCCCGAGCAGCACCAGCCCGACCGCGAGCACCGAACAACCAAT 180 

181 GAGAACATACAGGTTCCACTTGGACACCGGCGCTGGATTAAGGATTTCAACTGCGGTGAG 240 

241 ATTCTTCTTGTTGTTGTCCTCGAGTTTCGAGAAGCTGGGGTAATCGGGAGCTGTCATCTT 300 

301 TAAAGCACATCCTAAAACCGACAATTGAAAGTGATCAGCAACACTTTAGGGTATCGCGTG 360 

Predicted start of thrC translation 

ValGlyGluTyrCysValThrProT 
36 1 GGCGAAGTCACCTTTTTCAACATATTTGAGACGGTGTGGGGGAGTATTGTGTCACCCCTT 420 
ribosome binding sequence 

rpIleGlyLeuTyrProTrpThrThrPheArgProArgAspAlaSerArgThrProAlaA 
42 1 GGATAGGGTTATATCCGTGGACTACATTTCGACCGCGTGATGCCAGCCGTACCCCTGCCC 480 

rgPheSerAspIleLeuLeuGlyGlyLeuAlaProAspGlyGlyLeuTyrLeuProAlaT 
48 1 GCTTCAGTGATATTTTGCTGGGCGGTCTAGCACCAGACGGCGGCCTGTACCTGCCTGCAA 540 
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FIGURE 4 (CONT'D) 



hrTyrProGlnLeuAspAsp^laGlnLeuSerLysTrpArgGluValLeuAlaAsnGluG 
54 1 CCTACCCTCAACTAGATGATGCCCAGCTGAGTAAATGGCGTGAGGTATTAGCCAACGAAG 600 



lyTyrAlaAlaLeuAlaArgGluVallleSerLeuPheValAspAspIleProValGluA 
601 GATACGCAGCTTTGGCTCGTGAAGTTATCTCCCTGTTTGTTGATGACATCCCAGTAGAAG 660 



spIleLysAlalleThrAlaArgAlaTyrThrTyrProLyaPheAsnSerGliiAspIleV 
661 ACATCAAGGCGATCACCGCACGCGCCTACACCTACCCGAAGTTCAACAGCGAAGACATCG 720 



alProValThrGluLeuGluAspAsnlleTyrLeuGlyHisLeuSerGluGlyProThrA 
721 TTCCTGTCACCGAACTCGAGGACAACATTTACCTGGGCCACCTTTCCGAACCCGCAACCG 780 



laAlaPheLysAspMetAlaMetGlnLeuLeuGlyGluLeuPheGluTyrGluLeuArgA 
78 1 CTGCATTCAAAGACATGGCCATGCAGCTGCTCGGCGAACTTTTCGAATACGAGCITCGCC 



rgA^gAsnGluThrlleAanlleLeuGlyAlaThrSerGlyAspThrGlySerSerAlaG 
841 GCCGCAACGAAACCATCAACATCCTGGGCGCTACCTCTGGCGATACCGGCTCCTCTGCGG 



luTyrAlaMetArgGlyArgGluGlylleArgValPheMetLeuThrProAlaGlyArgM 
901 AATACGCCATGCGCGGCCGCGAGGGAATCCGCGTATTCATGCTGACCCCAGCTGGCCGCA 960 



etThrProPheGlnGlnAlaGlnMetPheGlyLeuAspAspProAsnllePheAsnlleA 
961 TGACCCCATTCCAGCAAGCACAGATGTTTGGCCTTGACGATCCAAACATCTTCAACATCG 1020 



laLeuAspGlyValPheAspAspCysGlnAspValValLysAlaValSerAlaAspAlaG 
1021 CCCTCGACGGCGTTTTCGACGATTGCCAAGACGTAGTCAAGGCTGTCTCCGCCGACGCAG 1080 



luPheLysLysAspAsnArglleGlyAlaValAsnSerlleAsnTrpAlaArgLeuMctA 
1081 AATTCAAAAAAGACAACCGC ATCGGTGCCGTGAACTCCATCAACTGGGCACGCCTTATGG 1140 



840 



900 
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FIGURE 4 (CONT'D) 



laGlnValValTyrTyrValSerSerTrpIleArgThrThrThrSerAsiiAspGlnLysV 
1 141 CACAGGTTGTGTACTACGTTTCCTCATGGATCCGCACCACAACCAGCAATGACCAAAAGG 1230 



alSerPheSerValProThrGlyAsnPheGlyAspIleCysAlaGlyHisIleAlaArgG 
1201 TCAGCTTCTCCGTACCAACCGGCAACTTCGGTGACATTTGCGCAGGCCACATCGCCCGCC 1260 



InMetGlyLeuProIleAspArgLeuIleValAlaThrAsnGluAsnAspValLeuAspG 
1261 AAATGGGACTTCCCATCGATCGCCTCATCGTGGCCAGCAACGAAAACGATGTGCTCGACG 1320 



luPhePheArgThrGlyAspTyrArgValArgSerSerAlaAspThrHisGluThrSerS 
1321 AGTTCTTCCGTACCGGCGACTACCGAGTCCGCAGCTCCGCAGACACCCACGAGACCTCCT 1380 



erProSerMetAspIleSerArgAlaSerAsnPheGluArgPhellePheAspLeuLeuG 
1381 CACCTTCGATGGATATCTCCCGCGCCTCCAACTTCGAGCGTTTCATCTTCGACCTGCTCG 1440 



lyArgAspAlaThrArgValAsnAspLeuPheGlyThrGlnValArgGlnGlyGlyPheS 
144 1 GCCGCGACGCC ACCCGCGTCAACGATCTATTTGGTACCCAGGTTCGCCAAGGCGGATTCT 1500 



erLeuAlaAspAspAlaAsnPheGluLysAlaAlaAlaGluTyrGlyPheAlaSerGlyA 
150 1 CACTGGCTGATGACGCCAACTTTGAGAAGGCTGCAGCAGAATACGGTTTCGCCTCCGGAC 1560 



rgSerThrHisAlaAspArgValAlaThrlleAlaAspValHlsSerArgLeuAspValL 
1561 GATCCACCCATGCTGACCGTGTGGCAACCATCGCTGACGTGCATTCCCGCCTCGACGTAC 1620 



eulleAspProHisThrAlaAspGlyValHisValAlaArgGlnTrpArgAspGluValA 
1621 TAATCGATCCCCACACCGCCGACGGCGTTCACGTGGCACGCCAGTGGAGGGACGAGGTCA 1680 



snThrProIlelleValLeuGluThrAlaLeuProValLysPheAlaAapThrlleValG 
1681 ACACCCCAATCATCGTCCTAGAAACTGCACTCCCAGTGAAATTTGCCGACACCATCGTCG 1740 
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FIGURE 4 (CONT'D) 

luAlalleGlyGluAlaProGlnThrProGluArgPheAlaAlalleMetAspAlaProP 
1741 AAGCAATTGGTGAAGCACCTCAAACTCCAGAGCGTTTCGCCGCGATCATGGATGCTCCAT '1800 

heLysValSerAspLeuProAsnAspThrAspAlaValLysGlnT^IleValAspAlal 
1 80 1 TCAAGGTTTCCGACCTACCAAACGACACCGATGCAGTTAAGCAGTACATAGTCGATGCGA 1 860 

leAlaAsnThrSerValLys 
1861 TTGCAAACACTTCCGTGAAGTAACTTGCTTTACGCCAAGGCCTGATTCCTCTCTTTATGG 1920 

1921 GATGGAACCAGGCCTTTCGCATTGAGTGGCGTTTTAAGGCCTCCAATTCTTAGAACGGGT 1980 
Computer predicted terminator structure, *+termination point 

1981 GTTTGACATGGAGGGGTCACAGTCAAGCCGTTAGAAGCGATTCTGGGAGGGCAAGTTTTT 2040 

2041 CGGAGTTGGAGGTCGAATTTCCGCTGAACTGAXGGGAACCAGACAGGCGTGACAAGATTG 2100 

2101 GCTAAAAACCTGAAGTTTTGTCACGCCTGTCTGGTTTCCCTCTTGTCGGTGCGAGCGAGT 2160 

2161 CCCTTGAACGACACAGATCGCGCCAAATGGAAGTGTCTGCGACCCCAGAATATTTGATTC 2220 

2221 CCCGGTCCGAGTCGTGCGAAAAATGCTCTGGTTAGTCCTCGATCATCGCAATCGCATCAA 2280 

2281 TTTCCACAGTTGCACCATAAGGAAGCGATGATGCACCCACGAAAGAGCGTGCCGGGCGGC 2340 

2341 CTTCGAGGAAATGCTCTCGGAATTGCTCGTTGCATTCTTCGCGCAGGCTGATGTCGGTGA 2400 

240 1 CAAAGTAAGTGAGTTTCACAACGTCTTTGAGTTCACCACCAGCGGTCGGAGGCGTTCACG 2460 

2461 CATGCGTTCAAGTGCTGCATCAACTGCTTCTTTACGACCGACGACTGGTTGGTAGTCCTT 2520 
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2521 GTCTACTGAAAGAGCGCCGGAGACGMGATGAAATTTCCGACGCGTTTTGCGGGGGACTT 2580 

2581 ATGGGTGATCATTCGACATGTGGCCAACCATAGCTGTTTCCCCGAAGAGAGTGCCGGAAC 2640 

264 1 AGGCATTTTAGAGGTGGGGGAGGACTTCTTCGTAAATCTGGGTCAGTACTTCGCTTGCTG 2700 

2701 GTCGGCGCTGGATGTTGAAGATGACGTGGTCGATGCCAAGTTCCGAAAGCGGTGGAGGTC 2760 

276 1 TTCGATGAGTTCCTGGCTGCCTACCTCTACGCCAGAGTGATTTCTTTGTGGGTGTTTCCT 2820 

2821 TCGGTGAGGTTGAGCCCCATGGAGGAAATCAACAAGGGGCGGGTGCCACCACGGGCTTTG 2880 

2881 TCCCAGAGATCGAGGCGTCCGACTTGAGCTTCAGCGGGGCGGTAGTAGGTTGCCCATCCG 2940 

2941 TCGGCGTTTCGGGCGATCCATTGCACTGTTTGTCGGGCAGAACCTACAGCGATCATGGGG 3000 

3001 ATCTGAGCIICAGGTGGCGTGGTTGGCGCAAATTCAAGGTCGGCCCGCATCGCAGGATCC 3060 

3061 TTCGACAAAGCTGCACGCAAAATTGCCCACCCAGACTGAATATCAGCGCGTCGATTGTCT 3120 

3121 AAGCTTTTCGGAAAAATCTCGAATTC 3146 
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mRNA start hyphenated dyad symmetry 

■10 . * > »_J \Met*** *_J . 

GTCTATAGTGGGTAGGTACCCTTTTTGTTTTGGACACATGTAGGGTGGCCGAAACAAAGT 
CAGATATCACCGATCCATGGGAAAAACAAAACCTGTGTACATCCCACCGGCTTTGTTTCA 

predicted start of thrA translation 

ThrSerAlaSerAla 

AATAGGACMCMCGCTCGACCGCGATTATTTTTGGAGAATCATGACCTCAGCATCTGCC 
TTATCCTGTTGTTGCGAGCTGGCGCTAATAAAAACCTCTTAGTACTGGAGTCGTAGACGG 
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